AITopics | vision-language pre-trained model

Collaborating Authors

vision-language pre-trained model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models

Neural Information Processing SystemsDec-26-2025, 06:07:08 GMT

Recent endeavors mainly focus on parameter efficient transfer learning (PETL) for VLP models by only updating a small number of parameters.

computation efficient transfer learning, vision-language pre-trained model, vlp model, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.61)

Add feedback

Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models

Neural Information Processing SystemsJan-19-2025, 11:19:15 GMT

Recent endeavors mainly focus on parameter efficient transfer learning (PETL) for VLP models by only updating a small number of parameters. In this paper, we aim at parameter and computation efficient transfer learning (PCETL) for VLP models. In particular, PCETL not only needs to limit the number of trainable parameters in VLP models, but also to reduce the computational redundancy during inference, thus enabling a more efficient transfer. To approach this target, we propose a novel dynamic architecture skipping (DAS) approach towards effective PCETL. Instead of directly optimizing the intrinsic architectures of VLP models, DAS first observes the significances of their modules to downstream tasks via a reinforcement learning (RL) based process, and then skips the redundant ones with lightweight networks, i.e. adapters, according to the obtained rewards.

computation efficient transfer learning, vision-language pre-trained model, vlp model, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.87)

Add feedback

Multimodal Search on Iconclass using Vision-Language Pre-Trained Models

Santini, Cristian, Posthumus, Etienne, Tan, Mary Ann, Bruns, Oleksandra, Tietz, Tabea, Sack, Harald

arXiv.org Artificial IntelligenceJun-23-2023

Terminology sources, such as controlled vocabularies, thesauri and classification systems, play a key role in digitizing cultural heritage. However, Information Retrieval (IR) systems that allow to query and explore these lexical resources often lack an adequate representation of the semantics behind the user's search, which can be conveyed through multiple expression modalities (e.g., images, keywords or textual descriptions). This paper presents the implementation of a new search engine for one of the most widely used iconography classification system, Iconclass. The novelty of this system is the use of a pre-trained vision-language model, namely CLIP, to retrieve and explore Iconclass concepts using visual or textual queries.

information retrieval, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2306.16529

Country:

Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.13)
North America > United States > Illinois > Cook County > Chicago (0.05)
Europe > Netherlands > South Holland > The Hague (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.93)

Add feedback